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Crystallization and preliminary X-ray diffraction 
analysis of Nsp15 from SARS coronavirus 


The non-structural protein Nspl5 from the aetiological agent of SARS (severe 
acute respiratory syndrome) has recently been characterized as a uridine- 
specific endoribonuclease. This enzyme plays an essential role in viral 
replication and transcription since a mutation in the related H229E human 
coronavirus nspl5 gene can abolish viral RNA synthesis. SARS full-length 
Nspl5 (346 amino acids) has been cloned and expressed in Escherichia coli with 
an N-terminal hexahistidine tag and has been purified to homogeneity. The 
protein was subsequently crystallized using PEG 8000 or 10 000 as precipitants. 
Small cubic crystals of the apoenzyme were obtained from 100 nl nanodrops. 
They belong to space group P4j32 or P4-32, with unit-cell parameters 
a = b = c = 166.8 A. Diffraction data were collected to a maximum resolution 
of 2.7 A. 


1. Introduction 

An outbreak of severe acute respiratory syndrome (SARS) initiated 
suddenly in China in 2003 and rapidly spread worldwide, causing 
more than 8000 fatalities. The aetiological agent of SARS was char¬ 
acterized as a coronavirus named SARS-CoV. This virus has a 
~30 kbp long single-strand RNA genome with a 5' cap and 3' poly(A) 
tail (Rota et al., 2003). 

The genome of SARS-CoV codes for two large polyproteins, la 
and lab, comprising the replicase components. These polypeptides 
are autoproteolytically processed by two viral proteases, yielding 16 
non-structural proteins (Nsps) which, together with a number of 
processing intermediates and host proteins, are thought to form a 
huge protein complex responsible for viral replication and tran¬ 
scription. Despite the growing interest in coronaviruses, the 
biochemical and structural characterization of the non-structural 
proteins still lags behind. In the last two years, despite tremendous 
efforts worldwide, only three X-ray structures of Nsps were reported: 
those of Nsp5, the main protease, of Nsp9, a novel RNA-binding 
protein, and of the complex Nsp7-Nsp8 (Anand et al., 2003; Egloff et 
al., 2004; Sutton et al., 2004; Zhai et al., 2005). 

A number of unusual activities of coronavirus Nsps have been 
identified that differ from those found in other RNA viruses and 
Nspl5 is a striking example. Comparative genomics has made it 
possible to identify similarity between Nspl5 and only one other 
protein, XendoU, an endoribonuclease from Xenopus laevis studied 
by Caffarelli et al. (1997). XendoU is involved in the processing of 
U16 small nucleolar RNA from its host pre-mRNA. It is poly(U)- 
specific and requires Mn 2+ as a cofactor (Caffarelli et al., 1997; Laneve 
et al., 2003). It has been suggested, therefore, that Nspl5 might be an 
endoribonuclease (Snijder et al., 2003), which is unique to nidoviruses 
within the RNA viral world. 

The biochemical characterization of SARS Nspl5 has recently 
been reported and its endoribonucleolytic activity as well as its 
specificity have been determined (Bhardwaj et al., 2004; Ivanov et al., 
2004). The main activity of Nspl5 is to cleave 5' to uridylates in 
double-stranded RNA, which requires the presence of divalent 
cations. Like other endoribonucleases, Nspl5 uses Mn 2+ as cofactor; 
in the presence of Mg 2+ or other metal cations lower or no activity is 
detected. Nspl5 has been shown to be highly specific for RNA. DNA 
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is not cleaved and does not inhibit RNA digestion, thus indicating 
that it does not bind to Nspl5 (Bhardwaj et al., 2004). Single-stranded 
RNA is also cleaved, but less specifically and efficiently than double- 
stranded RNA (Ivanov et al. , 2004). Electron microscopy as well as 
other biophysical techniques have revealed that Nspl5 monomers 
form hexameric structures (Guarino et al., 2005). 

RNases are divided into two classes (Saida et al. , 2003). The first 
group of enzymes is characterized by the presence of a metal ion in 
the active site. In the second group, the reaction is performed using a 
histidine as a general base and a histidine or a glutamic acid as a 
general acid (Irie et al., 1997; Nishikawa et al, 1987; Steyaert et al., 
1990; Thompson et al., 1994). The reaction generates a product 
carrying a cyclic 2'-3' phosphodiester terminus. Recent work (Gioia et 
al., 2005) has suggested that XendoU and the viral endoribonucleases 
related to it do not belong to either of the previously described 
groups. Instead, this work reveals the existence of a new class of 
RNA-processing enzymes that require a metal ion for activity (mainly 
Mn 2+ ), produce cyclic 2'-3' phosphodiester termini and have two 
histidine residues that are essential for RNase activity. 

The biochemical novelty of Nspl5 and the great medical impor¬ 
tance of SARS-CoV prompted us to initiate the crystallographic 
study of this enzyme. Full-length Nspl5 consists of a polypeptide 
chain of 346 amino acids and has a molecular weight of 38.5 kDa. In 
the present communication, we report the cloning, expression, puri¬ 
fication and crystallization of Nspl5 from SARS-CoV and preli¬ 
minary diffraction experiments on Nspl5 crystals. 


2. Expression and purification 

The SARS-CoV (Frankfurt-1 strain; NCBI accession No. AY291315) 
Nspl5 full coding sequence (amino acids 6430-6775 of polyprotein 
lab) was amplified by PCR from cDNA kindly provided by J. Dobbe 
and E. J. Snijder (Drosten et al, 2003) using two primers containing 
the attB sites of the Gateway recombination system (Invitrogen). A 
sequence encoding a hexahistidine tag was added at the 5' end of the 
gene. The cDNA was cloned in the pDestl4 plasmid according to 
Gateway cloning technology (Invitrogen). Multiparameter expres¬ 
sion screening in Escherichia coli was performed using a sparse 
matrix (Abergel et al, 2003), followed by the dot blot detection 
procedure (Vincentelli et al, 2005). The optimized expression 
condition used E. coli strain C41(DE3) (Avidis) transformed with the 
pLysS plasmid (Novagen) and grown in Turbo Broth media (Athena 
Enzymes) at 298 K. Overnight induction was performed with 0.5 m M 
IPTG when OD 6(K) reached 0.5. Expression yielded about 1.5 mg of 
soluble Nspl5 per litre of culture. The cells were harvested by 
centrifugation and resuspended in 10 ml of buffer A (50 mil Tris pH 
8, 150 mil NaCl, 10 mil imidazole, 1 mil PMSF, 0.2 mg ml -1 lyso¬ 
zyme, 0.1 pg ml 1 DNase, 20 mil MgS0 4 and protease-cocktail 
inhibitor; Sigma) per OD 6()() unit and per litre of culture. Cells were 
lysed by sonication and centrifuged (20 OOOg) for 40 min at 277 K. 
The purification consisted of two steps. Firstly, IMAC (immobilized 
metal-ion affinity chromatography) was performed on a Hisprep 
column (Amersham). The protein was eluted with 50 mil Tris-HCl, 
150 m M NaCl, 250 mil imidazole pH 8.0. Size-exclusion chromato¬ 
graphy was performed using a Superdex 200 HiLoad 26/60 (Amer¬ 
sham) in 10 mill Tris, 50 mil NaCl pH 7.5. Nspl5 was eluted with a 
retention volume corresponding to a molecular weight of about 
250 kDa, corresponding to a hexameric association, as also observed 
by Guarino et al. (2005). Protein purity was analyzed by SDS-PAGE. 
The recombinant protein was concentrated using Amicon Ultra 
(5000 kDa molecular-weight cutoff). 


3. Crystallization 

The protein was unstable in the buffer used for size-exclusion chro¬ 
matography and could not be concentrated beyond 2-3.5 mg ml -1 . 
Moreover, the protein had to be crystallized soon after purification to 
prevent precipitation. Microdialysis experiments indicated that the 
protein was more stable (i) at pH values between 6.5 and 9 and (ii) 
when the sodium chloride concentration was higher than 300 mil. 
However, crystallization failed under these conditions and the 
sodium chloride concentration had to be decreased, the final protein 
buffer being 50 mil NaCl, 10 mil Tris pH 7.5. 

Crystallization trials were performed using the sitting-drop tech¬ 
nique at 291 K using a nanodrop-dispensing robot (Cartesian Inc.). 
Wizard Screens 1 and 2 (Emerald Biostructures), Structure Screens 1 
and 2 and Stura Footprint Screen (Molecular Dimensions Ltd) were 
used for initial screening. Drops of 300, 200 and 100 nl protein 
solution (1.0-3.5 mg ml -1 ) were mixed with 100 nl mother liquor (the 
reservoir volume was 150 pi). Three conditions from the Stura 
Footprint Screen and one from the Wizard Screens yielded crystal 
hits. After optimization, two of these conditions led to small cubic 
crystals [10-20%(w/v) polyethylene glycol 10 000, 0.2 M imidazole 
malate pH 8.0-9.0 and 15-25%(w/v) polyethylene glycol 8000, 0.1 M 
CHES pH 9.0-10.0], The crystals grew in 2-4 d to maximum dimen¬ 
sions of 0.05 x 0.05 x 0.05 mm (Fig. 1). 


4. Preliminary crystallographic analysis 

The crystals of Nspl5 were flash-frozen in liquid nitrogen at 110 K. 
Glycerol was added to the mother liquor as a cryoprotectant to a final 
concentration of 33%. Diffraction data for the native enzyme were 
collected at the European Synchrotron Radiation Facility (ESRF, 
Grenoble, France) beamline ID14-3 using a MAR 165 CCD detector. 
The data were processed using MOSFLMISCALA (Collaborative 
Computational Project, Number 4, 1994). 

Nspl5 crystals diffracted X-rays to a resolution limit of 2.7 A. 
Crystals presented cubic symmetry, belonging to one of the two 
enantiomorphic space groups P4i32 or P4 3 32, with unit-cell para¬ 
meters a = b = c = 166.8 A (Table 1). Calculation of the Matthews 
coefficient indicated that the asymmetric unit may contain either one 
or two molecules of Nspl5, which would correspond to a calculated 
solvent content of 75 or 51%, respectively (Matthews, 1968). No peak 
was identified when searching for a twofold axis in the self-rotation 
map calculated using the program GLRF (Tong & Rossmann, 1997), 
thus suggesting that the asymmetric unit contains one molecule. No 



Figure 1 

Crystals of Nspl5 from SARS-CoV crystallized as the apoenzyme. 
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Table 1 

X-ray diffraction data statistics. 

Values in parentheses are for the highest resolution shell (2.85-2.70 A). 


Data collection 

ID14-3, ESRF 

Wavelength (nm) 

0.931 

Resolution (A) 

59-2.70 

Total observations 

268080 

Unique observations 

39242 

Completeness (%) 

100 (100) 

Redundancy 

11.9 (12.2) 

Average I/cr(I) 

14.5 (4.4) 

Wilson B factor 

50.2 

Rm=r E et (%) 

17.4 (60.5) 


t iW = Em, E, 1^ - W I/Em, E, I- 


structure with sufficient sequence similarity to Nspl5 is currently 
available; structure determination using selenomethionine- 
substituted protein for MAD/SAD phasing is thus necessary and has 
been initiated. 
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